Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 3312 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 543.5 KiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 6 |
|---|---|
| Categorical | 14 |
| DateTime | 1 |
Country has constant value "United States" | Constant |
Order ID has a high cardinality: 1687 distinct values | High cardinality |
Customer ID has a high cardinality: 693 distinct values | High cardinality |
Customer Name has a high cardinality: 693 distinct values | High cardinality |
City has a high cardinality: 350 distinct values | High cardinality |
Product ID has a high cardinality: 1525 distinct values | High cardinality |
Product Name has a high cardinality: 1511 distinct values | High cardinality |
Sales is highly correlated with Profit | High correlation |
Discount is highly correlated with Profit | High correlation |
Profit is highly correlated with Sales and 1 other fields | High correlation |
Sales is highly correlated with Profit | High correlation |
Profit is highly correlated with Sales | High correlation |
month_year is highly correlated with Country | High correlation |
Segment is highly correlated with Country | High correlation |
Ship Mode is highly correlated with Country | High correlation |
Country is highly correlated with month_year and 6 other fields | High correlation |
Region is highly correlated with Country and 1 other fields | High correlation |
State is highly correlated with Country and 1 other fields | High correlation |
Sub-Category is highly correlated with Country and 1 other fields | High correlation |
Category is highly correlated with Country and 1 other fields | High correlation |
State is highly correlated with Postal Code and 2 other fields | High correlation |
Postal Code is highly correlated with State and 2 other fields | High correlation |
Region is highly correlated with State and 1 other fields | High correlation |
Category is highly correlated with Sub-Category and 1 other fields | High correlation |
Sub-Category is highly correlated with Category and 1 other fields | High correlation |
Sales is highly correlated with Profit | High correlation |
Discount is highly correlated with State and 3 other fields | High correlation |
Profit is highly correlated with Sales | High correlation |
Product ID is uniformly distributed | Uniform |
Product Name is uniformly distributed | Uniform |
Row ID has unique values | Unique |
Discount has 1590 (48.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-03-16 18:05:03.487870 |
|---|---|
| Analysis finished | 2022-03-16 18:07:05.998742 |
| Duration | 2 minutes and 2.51 seconds |
| Software version | pandas-profiling v3.1.1 |
| Download configuration | config.json |
| Distinct | 3312 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 5087.107488 |
| Minimum | 13 |
|---|---|
| Maximum | 9994 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 13 |
|---|---|
| 5-th percentile | 643.55 |
| Q1 | 2655.75 |
| median | 5183.5 |
| Q3 | 7498.25 |
| 95-th percentile | 9471.45 |
| Maximum | 9994 |
| Range | 9981 |
| Interquartile range (IQR) | 4842.5 |
Descriptive statistics
| Standard deviation | 2817.482266 |
|---|---|
| Coefficient of variation (CV) | 0.5538475986 |
| Kurtosis | -1.180921704 |
| Mean | 5087.107488 |
| Median Absolute Deviation (MAD) | 2409.5 |
| Skewness | -0.01617289766 |
| Sum | 16848500 |
| Variance | 7938206.321 |
| Monotonicity | Strictly increasing |
| Value | Count | Frequency (%) |
| 13 | 1 | < 0.1% |
| 6694 | 1 | < 0.1% |
| 6671 | 1 | < 0.1% |
| 6680 | 1 | < 0.1% |
| 6681 | 1 | < 0.1% |
| 6682 | 1 | < 0.1% |
| 6683 | 1 | < 0.1% |
| 6689 | 1 | < 0.1% |
| 6690 | 1 | < 0.1% |
| 6691 | 1 | < 0.1% |
| Other values (3302) | 3302 |
| Value | Count | Frequency (%) |
| 13 | 1 | |
| 24 | 1 | |
| 35 | 1 | |
| 42 | 1 | |
| 44 | 1 | |
| 72 | 1 | |
| 76 | 1 | |
| 77 | 1 | |
| 78 | 1 | |
| 85 | 1 |
| Value | Count | Frequency (%) |
| 9994 | 1 | |
| 9993 | 1 | |
| 9992 | 1 | |
| 9991 | 1 | |
| 9989 | 1 | |
| 9988 | 1 | |
| 9982 | 1 | |
| 9970 | 1 | |
| 9969 | 1 | |
| 9968 | 1 |
| Distinct | 1687 |
|---|---|
| Distinct (%) | 50.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| CA-2017-100111 | 14 |
|---|---|
| CA-2017-157987 | 12 |
| CA-2017-140949 | 9 |
| CA-2017-117457 | 9 |
| CA-2017-156776 | 8 |
| Other values (1682) |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Characters and Unicode
| Total characters | 46368 |
|---|---|
| Distinct characters | 15 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 875 ? |
|---|---|
| Unique (%) | 26.4% |
Sample
| 1st row | CA-2017-114412 |
|---|---|
| 2nd row | US-2017-156909 |
| 3rd row | CA-2017-107727 |
| 4th row | CA-2017-120999 |
| 5th row | CA-2017-139619 |
Common Values
| Value | Count | Frequency (%) |
| CA-2017-100111 | 14 | 0.4% |
| CA-2017-157987 | 12 | 0.4% |
| CA-2017-140949 | 9 | 0.3% |
| CA-2017-117457 | 9 | 0.3% |
| CA-2017-156776 | 8 | 0.2% |
| CA-2017-118017 | 8 | 0.2% |
| CA-2017-140872 | 8 | 0.2% |
| CA-2017-102925 | 8 | 0.2% |
| CA-2017-110905 | 8 | 0.2% |
| CA-2017-113278 | 8 | 0.2% |
| Other values (1677) | 3220 |
Length
| Value | Count | Frequency (%) |
| ca-2017-100111 | 14 | 0.4% |
| ca-2017-157987 | 12 | 0.4% |
| ca-2017-140949 | 9 | 0.3% |
| ca-2017-117457 | 9 | 0.3% |
| ca-2017-110905 | 8 | 0.2% |
| us-2017-118087 | 8 | 0.2% |
| ca-2017-161956 | 8 | 0.2% |
| ca-2017-164756 | 8 | 0.2% |
| ca-2017-113278 | 8 | 0.2% |
| ca-2017-102925 | 8 | 0.2% |
| Other values (1677) | 3220 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 8448 | |
| - | 6624 | |
| 0 | 5199 | |
| 2 | 5169 | |
| 7 | 4664 | |
| C | 2732 | 5.9% |
| A | 2732 | 5.9% |
| 4 | 1809 | 3.9% |
| 6 | 1769 | 3.8% |
| 3 | 1739 | 3.8% |
| Other values (5) | 5483 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 33120 | |
| Dash Punctuation | 6624 | 14.3% |
| Uppercase Letter | 6624 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8448 | |
| 0 | 5199 | |
| 2 | 5169 | |
| 7 | 4664 | |
| 4 | 1809 | 5.5% |
| 6 | 1769 | 5.3% |
| 3 | 1739 | 5.3% |
| 5 | 1673 | 5.1% |
| 8 | 1344 | 4.1% |
| 9 | 1306 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2732 | |
| A | 2732 | |
| U | 580 | 8.8% |
| S | 580 | 8.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6624 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 39744 | |
| Latin | 6624 | 14.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 8448 | |
| - | 6624 | |
| 0 | 5199 | |
| 2 | 5169 | |
| 7 | 4664 | |
| 4 | 1809 | 4.6% |
| 6 | 1769 | 4.5% |
| 3 | 1739 | 4.4% |
| 5 | 1673 | 4.2% |
| 8 | 1344 | 3.4% |
Latin
| Value | Count | Frequency (%) |
| C | 2732 | |
| A | 2732 | |
| U | 580 | 8.8% |
| S | 580 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 46368 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 8448 | |
| - | 6624 | |
| 0 | 5199 | |
| 2 | 5169 | |
| 7 | 4664 | |
| C | 2732 | 5.9% |
| A | 2732 | 5.9% |
| 4 | 1809 | 3.9% |
| 6 | 1769 | 3.8% |
| 3 | 1739 | 3.8% |
| Other values (5) | 5483 |
Order Date
Date
| Distinct | 322 |
|---|---|
| Distinct (%) | 9.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Minimum | 2020-01-01 00:00:00 |
|---|---|
| Maximum | 2020-12-30 00:00:00 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Standard Class | |
|---|---|
| Second Class | |
| First Class | |
| Same Day | 186 |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 12.74818841 |
| Min length | 8 |
Characters and Unicode
| Total characters | 42222 |
|---|---|
| Distinct characters | 18 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Standard Class |
|---|---|
| 2nd row | Second Class |
| 3rd row | Second Class |
| 4th row | Standard Class |
| 5th row | Standard Class |
Common Values
| Value | Count | Frequency (%) |
| Standard Class | 1897 | |
| Second Class | 657 | 19.8% |
| First Class | 572 | 17.3% |
| Same Day | 186 | 5.6% |
Length
Pie chart
| Value | Count | Frequency (%) |
| class | 3126 | |
| standard | 1897 | |
| second | 657 | 9.9% |
| first | 572 | 8.6% |
| same | 186 | 2.8% |
| day | 186 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7292 | |
| s | 6824 | |
| d | 4451 | |
| 3312 | ||
| l | 3126 | |
| C | 3126 | |
| S | 2740 | 6.5% |
| n | 2554 | 6.0% |
| r | 2469 | 5.8% |
| t | 2469 | 5.8% |
| Other values (8) | 3859 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32286 | |
| Uppercase Letter | 6624 | 15.7% |
| Space Separator | 3312 | 7.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7292 | |
| s | 6824 | |
| d | 4451 | |
| l | 3126 | |
| n | 2554 | 7.9% |
| r | 2469 | 7.6% |
| t | 2469 | 7.6% |
| e | 843 | 2.6% |
| c | 657 | 2.0% |
| o | 657 | 2.0% |
| Other values (3) | 944 | 2.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 3126 | |
| S | 2740 | |
| F | 572 | 8.6% |
| D | 186 | 2.8% |
Space Separator
| Value | Count | Frequency (%) |
| 3312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 38910 | |
| Common | 3312 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7292 | |
| s | 6824 | |
| d | 4451 | |
| l | 3126 | |
| C | 3126 | |
| S | 2740 | 7.0% |
| n | 2554 | 6.6% |
| r | 2469 | 6.3% |
| t | 2469 | 6.3% |
| e | 843 | 2.2% |
| Other values (7) | 3016 |
Common
| Value | Count | Frequency (%) |
| 3312 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42222 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7292 | |
| s | 6824 | |
| d | 4451 | |
| 3312 | ||
| l | 3126 | |
| C | 3126 | |
| S | 2740 | 6.5% |
| n | 2554 | 6.0% |
| r | 2469 | 5.8% |
| t | 2469 | 5.8% |
| Other values (8) | 3859 |
| Distinct | 693 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| SV-20365 | 20 |
|---|---|
| JL-15835 | 20 |
| Dp-13240 | 19 |
| MH-18115 | 19 |
| LC-16870 | 17 |
| Other values (688) |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 8 |
| Min length | 8 |
Characters and Unicode
| Total characters | 26496 |
|---|---|
| Distinct characters | 39 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 96 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | AA-10480 |
|---|---|
| 2nd row | SF-20065 |
| 3rd row | MA-17560 |
| 4th row | LC-16930 |
| 5th row | ES-14080 |
Common Values
| Value | Count | Frequency (%) |
| SV-20365 | 20 | 0.6% |
| JL-15835 | 20 | 0.6% |
| Dp-13240 | 19 | 0.6% |
| MH-18115 | 19 | 0.6% |
| LC-16870 | 17 | 0.5% |
| SS-20140 | 16 | 0.5% |
| AC-10615 | 16 | 0.5% |
| JM-15250 | 15 | 0.5% |
| EP-13915 | 15 | 0.5% |
| DS-13030 | 15 | 0.5% |
| Other values (683) | 3140 |
Length
| Value | Count | Frequency (%) |
| sv-20365 | 20 | 0.6% |
| jl-15835 | 20 | 0.6% |
| dp-13240 | 19 | 0.6% |
| mh-18115 | 19 | 0.6% |
| lc-16870 | 17 | 0.5% |
| ss-20140 | 16 | 0.5% |
| ac-10615 | 16 | 0.5% |
| jm-15250 | 15 | 0.5% |
| ep-13915 | 15 | 0.5% |
| ds-13030 | 15 | 0.5% |
| Other values (683) | 3140 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 4014 | |
| - | 3312 | |
| 0 | 2857 | 10.8% |
| 5 | 2602 | 9.8% |
| 2 | 1533 | 5.8% |
| 8 | 998 | 3.8% |
| 3 | 993 | 3.7% |
| 6 | 921 | 3.5% |
| 9 | 890 | 3.4% |
| 4 | 878 | 3.3% |
| Other values (29) | 7498 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16560 | |
| Uppercase Letter | 6603 | 24.9% |
| Dash Punctuation | 3312 | 12.5% |
| Lowercase Letter | 21 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 610 | 9.2% |
| C | 574 | 8.7% |
| B | 552 | 8.4% |
| M | 528 | 8.0% |
| D | 465 | 7.0% |
| J | 409 | 6.2% |
| A | 385 | 5.8% |
| H | 343 | 5.2% |
| P | 330 | 5.0% |
| R | 299 | 4.5% |
| Other values (16) | 2108 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 4014 | |
| 0 | 2857 | |
| 5 | 2602 | |
| 2 | 1533 | 9.3% |
| 8 | 998 | 6.0% |
| 3 | 993 | 6.0% |
| 6 | 921 | 5.6% |
| 9 | 890 | 5.4% |
| 4 | 878 | 5.3% |
| 7 | 874 | 5.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 19 | |
| l | 2 | 9.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 19872 | |
| Latin | 6624 | 25.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 610 | 9.2% |
| C | 574 | 8.7% |
| B | 552 | 8.3% |
| M | 528 | 8.0% |
| D | 465 | 7.0% |
| J | 409 | 6.2% |
| A | 385 | 5.8% |
| H | 343 | 5.2% |
| P | 330 | 5.0% |
| R | 299 | 4.5% |
| Other values (18) | 2129 |
Common
| Value | Count | Frequency (%) |
| 1 | 4014 | |
| - | 3312 | |
| 0 | 2857 | |
| 5 | 2602 | |
| 2 | 1533 | 7.7% |
| 8 | 998 | 5.0% |
| 3 | 993 | 5.0% |
| 6 | 921 | 4.6% |
| 9 | 890 | 4.5% |
| 4 | 878 | 4.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 26496 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 4014 | |
| - | 3312 | |
| 0 | 2857 | 10.8% |
| 5 | 2602 | 9.8% |
| 2 | 1533 | 5.8% |
| 8 | 998 | 3.8% |
| 3 | 993 | 3.7% |
| 6 | 921 | 3.5% |
| 9 | 890 | 3.4% |
| 4 | 878 | 3.3% |
| Other values (29) | 7498 |
| Distinct | 693 |
|---|---|
| Distinct (%) | 20.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Seth Vernon | 20 |
|---|---|
| John Lee | 20 |
| Dean percer | 19 |
| Mick Hernandez | 19 |
| Lena Cacioppo | 17 |
| Other values (688) |
Length
| Max length | 22 |
|---|---|
| Median length | 13 |
| Mean length | 12.9794686 |
| Min length | 7 |
Characters and Unicode
| Total characters | 42988 |
|---|---|
| Distinct characters | 56 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 2 ? |
Unique
| Unique | 96 ? |
|---|---|
| Unique (%) | 2.9% |
Sample
| 1st row | Andrew Allen |
|---|---|
| 2nd row | Sandra Flanagan |
| 3rd row | Matt Abelman |
| 4th row | Linda Cazamias |
| 5th row | Erin Smith |
Common Values
| Value | Count | Frequency (%) |
| Seth Vernon | 20 | 0.6% |
| John Lee | 20 | 0.6% |
| Dean percer | 19 | 0.6% |
| Mick Hernandez | 19 | 0.6% |
| Lena Cacioppo | 17 | 0.5% |
| Saphhira Shifley | 16 | 0.5% |
| Ann Chong | 16 | 0.5% |
| Janet Martin | 15 | 0.5% |
| Emily Phan | 15 | 0.5% |
| Darrin Sayre | 15 | 0.5% |
| Other values (683) | 3140 |
Length
| Value | Count | Frequency (%) |
| frank | 49 | 0.7% |
| patrick | 42 | 0.6% |
| john | 41 | 0.6% |
| michael | 39 | 0.6% |
| ann | 38 | 0.6% |
| bill | 36 | 0.5% |
| alan | 34 | 0.5% |
| rick | 33 | 0.5% |
| mick | 31 | 0.5% |
| dean | 30 | 0.5% |
| Other values (829) | 6281 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3965 | 9.2% |
| e | 3905 | 9.1% |
| n | 3495 | 8.1% |
| 3342 | 7.8% | |
| r | 3111 | 7.2% |
| i | 2632 | 6.1% |
| l | 2124 | 4.9% |
| o | 2024 | 4.7% |
| t | 1738 | 4.0% |
| s | 1541 | 3.6% |
| Other values (46) | 15111 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 32760 | |
| Uppercase Letter | 6814 | 15.9% |
| Space Separator | 3342 | 7.8% |
| Other Punctuation | 62 | 0.1% |
| Dash Punctuation | 10 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3965 | |
| e | 3905 | |
| n | 3495 | |
| r | 3111 | |
| i | 2632 | 8.0% |
| l | 2124 | 6.5% |
| o | 2024 | 6.2% |
| t | 1738 | 5.3% |
| s | 1541 | 4.7% |
| h | 1306 | 4.0% |
| Other values (17) | 6919 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 620 | 9.1% |
| S | 610 | 9.0% |
| B | 573 | 8.4% |
| M | 550 | 8.1% |
| D | 482 | 7.1% |
| J | 409 | 6.0% |
| A | 400 | 5.9% |
| H | 358 | 5.3% |
| P | 330 | 4.8% |
| R | 310 | 4.5% |
| Other values (16) | 2172 |
Space Separator
| Value | Count | Frequency (%) |
| 3342 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 62 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39574 | |
| Common | 3414 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3965 | 10.0% |
| e | 3905 | 9.9% |
| n | 3495 | 8.8% |
| r | 3111 | 7.9% |
| i | 2632 | 6.7% |
| l | 2124 | 5.4% |
| o | 2024 | 5.1% |
| t | 1738 | 4.4% |
| s | 1541 | 3.9% |
| h | 1306 | 3.3% |
| Other values (43) | 13733 |
Common
| Value | Count | Frequency (%) |
| 3342 | ||
| ' | 62 | 1.8% |
| - | 10 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42967 | |
| None | 21 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3965 | 9.2% |
| e | 3905 | 9.1% |
| n | 3495 | 8.1% |
| 3342 | 7.8% | |
| r | 3111 | 7.2% |
| i | 2632 | 6.1% |
| l | 2124 | 4.9% |
| o | 2024 | 4.7% |
| t | 1738 | 4.0% |
| s | 1541 | 3.6% |
| Other values (44) | 15090 |
None
| Value | Count | Frequency (%) |
| ö | 18 | |
| ü | 3 | 14.3% |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Consumer | |
|---|---|
| Corporate | |
| Home Office |
Length
| Max length | 11 |
|---|---|
| Median length | 8 |
| Mean length | 8.897342995 |
| Min length | 8 |
Characters and Unicode
| Total characters | 29468 |
|---|---|
| Distinct characters | 17 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Consumer |
|---|---|
| 2nd row | Consumer |
| 3rd row | Home Office |
| 4th row | Corporate |
| 5th row | Corporate |
Common Values
| Value | Count | Frequency (%) |
| Consumer | 1668 | |
| Corporate | 980 | |
| Home Office | 664 | 20.0% |
Length
Pie chart
| Value | Count | Frequency (%) |
| consumer | 1668 | |
| corporate | 980 | |
| home | 664 | 16.7% |
| office | 664 | 16.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4292 | |
| e | 3976 | |
| r | 3628 | |
| C | 2648 | |
| m | 2332 | |
| n | 1668 | 5.7% |
| s | 1668 | 5.7% |
| u | 1668 | 5.7% |
| f | 1328 | 4.5% |
| t | 980 | 3.3% |
| Other values (7) | 5280 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24828 | |
| Uppercase Letter | 3976 | 13.5% |
| Space Separator | 664 | 2.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4292 | |
| e | 3976 | |
| r | 3628 | |
| m | 2332 | |
| n | 1668 | 6.7% |
| s | 1668 | 6.7% |
| u | 1668 | 6.7% |
| f | 1328 | 5.3% |
| t | 980 | 3.9% |
| p | 980 | 3.9% |
| Other values (3) | 2308 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 2648 | |
| H | 664 | 16.7% |
| O | 664 | 16.7% |
Space Separator
| Value | Count | Frequency (%) |
| 664 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28804 | |
| Common | 664 | 2.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 4292 | |
| e | 3976 | |
| r | 3628 | |
| C | 2648 | |
| m | 2332 | |
| n | 1668 | 5.8% |
| s | 1668 | 5.8% |
| u | 1668 | 5.8% |
| f | 1328 | 4.6% |
| t | 980 | 3.4% |
| Other values (6) | 4616 |
Common
| Value | Count | Frequency (%) |
| 664 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 29468 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4292 | |
| e | 3976 | |
| r | 3628 | |
| C | 2648 | |
| m | 2332 | |
| n | 1668 | 5.7% |
| s | 1668 | 5.7% |
| u | 1668 | 5.7% |
| f | 1328 | 4.5% |
| t | 980 | 3.3% |
| Other values (7) | 5280 |
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| United States |
|---|
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Characters and Unicode
| Total characters | 43056 |
|---|---|
| Distinct characters | 10 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | United States |
| 3rd row | United States |
| 4th row | United States |
| 5th row | United States |
Common Values
| Value | Count | Frequency (%) |
| United States | 3312 |
Length
Pie chart
| Value | Count | Frequency (%) |
| united | 3312 | |
| states | 3312 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 9936 | |
| e | 6624 | |
| U | 3312 | 7.7% |
| n | 3312 | 7.7% |
| i | 3312 | 7.7% |
| d | 3312 | 7.7% |
| 3312 | 7.7% | |
| S | 3312 | 7.7% |
| a | 3312 | 7.7% |
| s | 3312 | 7.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 33120 | |
| Uppercase Letter | 6624 | 15.4% |
| Space Separator | 3312 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 9936 | |
| e | 6624 | |
| n | 3312 | 10.0% |
| i | 3312 | 10.0% |
| d | 3312 | 10.0% |
| a | 3312 | 10.0% |
| s | 3312 | 10.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 3312 | |
| S | 3312 |
Space Separator
| Value | Count | Frequency (%) |
| 3312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 39744 | |
| Common | 3312 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 9936 | |
| e | 6624 | |
| U | 3312 | 8.3% |
| n | 3312 | 8.3% |
| i | 3312 | 8.3% |
| d | 3312 | 8.3% |
| S | 3312 | 8.3% |
| a | 3312 | 8.3% |
| s | 3312 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 3312 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43056 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 9936 | |
| e | 6624 | |
| U | 3312 | 7.7% |
| n | 3312 | 7.7% |
| i | 3312 | 7.7% |
| d | 3312 | 7.7% |
| 3312 | 7.7% | |
| S | 3312 | 7.7% |
| a | 3312 | 7.7% |
| s | 3312 | 7.7% |
| Distinct | 350 |
|---|---|
| Distinct (%) | 10.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| New York City | |
|---|---|
| Los Angeles | 210 |
| San Francisco | 190 |
| Seattle | 182 |
| Philadelphia | 182 |
| Other values (345) |
Length
| Max length | 16 |
|---|---|
| Median length | 9 |
| Mean length | 9.317934783 |
| Min length | 4 |
Characters and Unicode
| Total characters | 30861 |
|---|---|
| Distinct characters | 50 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 86 ? |
|---|---|
| Unique (%) | 2.6% |
Sample
| 1st row | Concord |
|---|---|
| 2nd row | Philadelphia |
| 3rd row | Houston |
| 4th row | Naperville |
| 5th row | Melbourne |
Common Values
| Value | Count | Frequency (%) |
| New York City | 306 | 9.2% |
| Los Angeles | 210 | 6.3% |
| San Francisco | 190 | 5.7% |
| Seattle | 182 | 5.5% |
| Philadelphia | 182 | 5.5% |
| Chicago | 114 | 3.4% |
| Houston | 104 | 3.1% |
| Columbus | 82 | 2.5% |
| Dallas | 70 | 2.1% |
| Jacksonville | 45 | 1.4% |
| Other values (340) | 1827 |
Length
| Value | Count | Frequency (%) |
| city | 327 | 7.1% |
| new | 314 | 6.8% |
| york | 308 | 6.7% |
| san | 246 | 5.3% |
| los | 210 | 4.5% |
| angeles | 210 | 4.5% |
| francisco | 190 | 4.1% |
| seattle | 182 | 3.9% |
| philadelphia | 182 | 3.9% |
| chicago | 114 | 2.5% |
| Other values (371) | 2340 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2941 | 9.5% |
| a | 2553 | 8.3% |
| o | 2391 | 7.7% |
| l | 2098 | 6.8% |
| i | 2094 | 6.8% |
| n | 1997 | 6.5% |
| t | 1522 | 4.9% |
| s | 1508 | 4.9% |
| r | 1446 | 4.7% |
| 1311 | 4.2% | |
| Other values (40) | 11000 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 24927 | |
| Uppercase Letter | 4623 | 15.0% |
| Space Separator | 1311 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2941 | |
| a | 2553 | |
| o | 2391 | |
| l | 2098 | 8.4% |
| i | 2094 | 8.4% |
| n | 1997 | 8.0% |
| t | 1522 | 6.1% |
| s | 1508 | 6.0% |
| r | 1446 | 5.8% |
| c | 844 | 3.4% |
| Other values (15) | 5533 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 707 | |
| S | 586 | |
| L | 392 | |
| N | 365 | |
| A | 347 | 7.5% |
| P | 340 | 7.4% |
| Y | 313 | 6.8% |
| F | 311 | 6.7% |
| D | 188 | 4.1% |
| M | 174 | 3.8% |
| Other values (14) | 900 |
Space Separator
| Value | Count | Frequency (%) |
| 1311 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29550 | |
| Common | 1311 | 4.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2941 | 10.0% |
| a | 2553 | 8.6% |
| o | 2391 | 8.1% |
| l | 2098 | 7.1% |
| i | 2094 | 7.1% |
| n | 1997 | 6.8% |
| t | 1522 | 5.2% |
| s | 1508 | 5.1% |
| r | 1446 | 4.9% |
| c | 844 | 2.9% |
| Other values (39) | 10156 |
Common
| Value | Count | Frequency (%) |
| 1311 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30861 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2941 | 9.5% |
| a | 2553 | 8.3% |
| o | 2391 | 7.7% |
| l | 2098 | 6.8% |
| i | 2094 | 6.8% |
| n | 1997 | 6.5% |
| t | 1522 | 4.9% |
| s | 1508 | 4.9% |
| r | 1446 | 4.7% |
| 1311 | 4.2% | |
| Other values (40) | 11000 |
| Distinct | 47 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| California | |
|---|---|
| New York | |
| Texas | |
| Washington | |
| Pennsylvania | |
| Other values (42) |
Length
| Max length | 20 |
|---|---|
| Median length | 8 |
| Mean length | 8.538949275 |
| Min length | 4 |
Characters and Unicode
| Total characters | 28281 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | North Carolina |
|---|---|
| 2nd row | Pennsylvania |
| 3rd row | Texas |
| 4th row | Illinois |
| 5th row | Florida |
Common Values
| Value | Count | Frequency (%) |
| California | 663 | |
| New York | 352 | 10.6% |
| Texas | 317 | 9.6% |
| Washington | 215 | 6.5% |
| Pennsylvania | 197 | 5.9% |
| Illinois | 172 | 5.2% |
| Ohio | 161 | 4.9% |
| Florida | 126 | 3.8% |
| North Carolina | 85 | 2.6% |
| Tennessee | 81 | 2.4% |
| Other values (37) | 943 |
Length
| Value | Count | Frequency (%) |
| california | 663 | |
| new | 424 | 10.9% |
| york | 352 | 9.1% |
| texas | 317 | 8.2% |
| washington | 215 | 5.6% |
| pennsylvania | 197 | 5.1% |
| illinois | 172 | 4.4% |
| ohio | 161 | 4.2% |
| florida | 126 | 3.3% |
| carolina | 93 | 2.4% |
| Other values (41) | 1153 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3522 | |
| i | 3235 | |
| n | 2820 | 10.0% |
| o | 2493 | 8.8% |
| r | 1764 | 6.2% |
| e | 1715 | 6.1% |
| l | 1596 | 5.6% |
| s | 1578 | 5.6% |
| C | 853 | 3.0% |
| f | 665 | 2.4% |
| Other values (36) | 8040 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23849 | |
| Uppercase Letter | 3871 | 13.7% |
| Space Separator | 561 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3522 | |
| i | 3235 | |
| n | 2820 | |
| o | 2493 | |
| r | 1764 | |
| e | 1715 | |
| l | 1596 | |
| s | 1578 | |
| f | 665 | 2.8% |
| h | 658 | 2.8% |
| Other values (14) | 3803 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 853 | |
| N | 532 | |
| T | 398 | |
| Y | 352 | |
| I | 278 | 7.2% |
| W | 252 | 6.5% |
| M | 247 | 6.4% |
| O | 208 | 5.4% |
| P | 197 | 5.1% |
| F | 126 | 3.3% |
| Other values (11) | 428 |
Space Separator
| Value | Count | Frequency (%) |
| 561 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27720 | |
| Common | 561 | 2.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3522 | |
| i | 3235 | |
| n | 2820 | 10.2% |
| o | 2493 | 9.0% |
| r | 1764 | 6.4% |
| e | 1715 | 6.2% |
| l | 1596 | 5.8% |
| s | 1578 | 5.7% |
| C | 853 | 3.1% |
| f | 665 | 2.4% |
| Other values (35) | 7479 |
Common
| Value | Count | Frequency (%) |
| 561 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3522 | |
| i | 3235 | |
| n | 2820 | 10.0% |
| o | 2493 | 8.8% |
| r | 1764 | 6.2% |
| e | 1715 | 6.1% |
| l | 1596 | 5.6% |
| s | 1578 | 5.6% |
| C | 853 | 3.0% |
| f | 665 | 2.4% |
| Other values (36) | 8040 |
| Distinct | 437 |
|---|---|
| Distinct (%) | 13.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 56186.5151 |
| Minimum | 1841 |
|---|---|
| Maximum | 99301 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 1841 |
|---|---|
| 5-th percentile | 10009 |
| Q1 | 27978.75 |
| median | 60472.5 |
| Q3 | 90032 |
| 95-th percentile | 98103 |
| Maximum | 99301 |
| Range | 97460 |
| Interquartile range (IQR) | 62053.25 |
Descriptive statistics
| Standard deviation | 31980.37552 |
|---|---|
| Coefficient of variation (CV) | 0.5691824001 |
| Kurtosis | -1.459212056 |
| Mean | 56186.5151 |
| Median Absolute Deviation (MAD) | 29576.5 |
| Skewness | -0.1625508912 |
| Sum | 186089738 |
| Variance | 1022744418 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 10035 | 89 | 2.7% |
| 10009 | 81 | 2.4% |
| 10024 | 73 | 2.2% |
| 98105 | 71 | 2.1% |
| 94122 | 70 | 2.1% |
| 94110 | 67 | 2.0% |
| 10011 | 63 | 1.9% |
| 98103 | 63 | 1.9% |
| 94109 | 53 | 1.6% |
| 19140 | 52 | 1.6% |
| Other values (427) | 2630 |
| Value | Count | Frequency (%) |
| 1841 | 12 | |
| 1852 | 7 | |
| 2038 | 2 | 0.1% |
| 2138 | 2 | 0.1% |
| 2149 | 9 | |
| 2169 | 3 | 0.1% |
| 2740 | 4 | 0.1% |
| 2886 | 3 | 0.1% |
| 2895 | 2 | 0.1% |
| 2908 | 7 |
| Value | Count | Frequency (%) |
| 99301 | 2 | 0.1% |
| 99207 | 4 | 0.1% |
| 98661 | 1 | < 0.1% |
| 98632 | 2 | 0.1% |
| 98502 | 2 | 0.1% |
| 98226 | 3 | 0.1% |
| 98208 | 1 | < 0.1% |
| 98115 | 48 | |
| 98105 | 71 | |
| 98103 | 63 |
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| West | |
|---|---|
| East | |
| Central | |
| South |
Length
| Max length | 7 |
|---|---|
| Median length | 4 |
| Mean length | 4.861111111 |
| Min length | 4 |
Characters and Unicode
| Total characters | 16100 |
|---|---|
| Distinct characters | 14 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | South |
|---|---|
| 2nd row | East |
| 3rd row | Central |
| 4th row | Central |
| 5th row | South |
Common Values
| Value | Count | Frequency (%) |
| West | 1095 | |
| East | 921 | |
| Central | 778 | |
| South | 518 |
Length
Pie chart
| Value | Count | Frequency (%) |
| west | 1095 | |
| east | 921 | |
| central | 778 | |
| south | 518 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 3312 | |
| s | 2016 | |
| e | 1873 | |
| a | 1699 | |
| W | 1095 | 6.8% |
| E | 921 | 5.7% |
| C | 778 | 4.8% |
| n | 778 | 4.8% |
| r | 778 | 4.8% |
| l | 778 | 4.8% |
| Other values (4) | 2072 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12788 | |
| Uppercase Letter | 3312 | 20.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3312 | |
| s | 2016 | |
| e | 1873 | |
| a | 1699 | |
| n | 778 | 6.1% |
| r | 778 | 6.1% |
| l | 778 | 6.1% |
| o | 518 | 4.1% |
| u | 518 | 4.1% |
| h | 518 | 4.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 1095 | |
| E | 921 | |
| C | 778 | |
| S | 518 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16100 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 3312 | |
| s | 2016 | |
| e | 1873 | |
| a | 1699 | |
| W | 1095 | 6.8% |
| E | 921 | 5.7% |
| C | 778 | 4.8% |
| n | 778 | 4.8% |
| r | 778 | 4.8% |
| l | 778 | 4.8% |
| Other values (4) | 2072 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16100 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 3312 | |
| s | 2016 | |
| e | 1873 | |
| a | 1699 | |
| W | 1095 | 6.8% |
| E | 921 | 5.7% |
| C | 778 | 4.8% |
| n | 778 | 4.8% |
| r | 778 | 4.8% |
| l | 778 | 4.8% |
| Other values (4) | 2072 |
| Distinct | 1525 |
|---|---|
| Distinct (%) | 46.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| FUR-CH-10003774 | 8 |
|---|---|
| OFF-ST-10001325 | 7 |
| TEC-AC-10003832 | 7 |
| OFF-BI-10004632 | 7 |
| OFF-PA-10003673 | 7 |
| Other values (1520) |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Characters and Unicode
| Total characters | 49680 |
|---|---|
| Distinct characters | 27 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 554 ? |
|---|---|
| Unique (%) | 16.7% |
Sample
| 1st row | OFF-PA-10002365 |
|---|---|
| 2nd row | FUR-CH-10002774 |
| 3rd row | OFF-PA-10000249 |
| 4th row | TEC-PH-10004093 |
| 5th row | OFF-ST-10003282 |
Common Values
| Value | Count | Frequency (%) |
| FUR-CH-10003774 | 8 | 0.2% |
| OFF-ST-10001325 | 7 | 0.2% |
| TEC-AC-10003832 | 7 | 0.2% |
| OFF-BI-10004632 | 7 | 0.2% |
| OFF-PA-10003673 | 7 | 0.2% |
| OFF-ST-10003208 | 7 | 0.2% |
| OFF-PA-10001970 | 7 | 0.2% |
| TEC-AC-10004510 | 7 | 0.2% |
| OFF-BI-10003274 | 6 | 0.2% |
| FUR-TA-10001520 | 6 | 0.2% |
| Other values (1515) | 3243 |
Length
| Value | Count | Frequency (%) |
| fur-ch-10003774 | 8 | 0.2% |
| off-st-10003208 | 7 | 0.2% |
| tec-ac-10004510 | 7 | 0.2% |
| off-pa-10001970 | 7 | 0.2% |
| off-st-10001325 | 7 | 0.2% |
| off-pa-10003673 | 7 | 0.2% |
| tec-ac-10003832 | 7 | 0.2% |
| off-bi-10004632 | 7 | 0.2% |
| off-bi-10002012 | 6 | 0.2% |
| off-st-10000615 | 6 | 0.2% |
| Other values (1515) | 3243 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 11602 | |
| - | 6624 | |
| F | 5070 | |
| 1 | 5029 | |
| O | 2100 | 4.2% |
| 4 | 1644 | 3.3% |
| 3 | 1608 | 3.2% |
| 2 | 1598 | 3.2% |
| A | 1496 | 3.0% |
| C | 1111 | 2.2% |
| Other values (17) | 11798 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26496 | |
| Uppercase Letter | 16560 | |
| Dash Punctuation | 6624 | 13.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 5070 | |
| O | 2100 | |
| A | 1496 | 9.0% |
| C | 1111 | 6.7% |
| U | 1061 | 6.4% |
| T | 1016 | 6.1% |
| R | 968 | 5.8% |
| P | 918 | 5.5% |
| E | 695 | 4.2% |
| B | 576 | 3.5% |
| Other values (6) | 1549 | 9.4% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 11602 | |
| 1 | 5029 | |
| 4 | 1644 | 6.2% |
| 3 | 1608 | 6.1% |
| 2 | 1598 | 6.0% |
| 5 | 1094 | 4.1% |
| 7 | 1018 | 3.8% |
| 9 | 978 | 3.7% |
| 6 | 976 | 3.7% |
| 8 | 949 | 3.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6624 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 33120 | |
| Latin | 16560 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| F | 5070 | |
| O | 2100 | |
| A | 1496 | 9.0% |
| C | 1111 | 6.7% |
| U | 1061 | 6.4% |
| T | 1016 | 6.1% |
| R | 968 | 5.8% |
| P | 918 | 5.5% |
| E | 695 | 4.2% |
| B | 576 | 3.5% |
| Other values (6) | 1549 | 9.4% |
Common
| Value | Count | Frequency (%) |
| 0 | 11602 | |
| - | 6624 | |
| 1 | 5029 | |
| 4 | 1644 | 5.0% |
| 3 | 1608 | 4.9% |
| 2 | 1598 | 4.8% |
| 5 | 1094 | 3.3% |
| 7 | 1018 | 3.1% |
| 9 | 978 | 3.0% |
| 6 | 976 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 49680 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 11602 | |
| - | 6624 | |
| F | 5070 | |
| 1 | 5029 | |
| O | 2100 | 4.2% |
| 4 | 1644 | 3.3% |
| 3 | 1608 | 3.2% |
| 2 | 1598 | 3.2% |
| A | 1496 | 3.0% |
| C | 1111 | 2.2% |
| Other values (17) | 11798 |
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Office Supplies | |
|---|---|
| Furniture | |
| Technology |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.81521739 |
| Min length | 9 |
Characters and Unicode
| Total characters | 42444 |
|---|---|
| Distinct characters | 20 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Office Supplies |
|---|---|
| 2nd row | Furniture |
| 3rd row | Office Supplies |
| 4th row | Technology |
| 5th row | Office Supplies |
Common Values
| Value | Count | Frequency (%) |
| Office Supplies | 2002 | |
| Furniture | 686 | 20.7% |
| Technology | 624 | 18.8% |
Length
Pie chart
| Value | Count | Frequency (%) |
| office | 2002 | |
| supplies | 2002 | |
| furniture | 686 | 12.9% |
| technology | 624 | 11.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 5314 | |
| i | 4690 | |
| p | 4004 | |
| f | 4004 | |
| u | 3374 | 7.9% |
| c | 2626 | 6.2% |
| l | 2626 | 6.2% |
| O | 2002 | 4.7% |
| s | 2002 | 4.7% |
| S | 2002 | 4.7% |
| Other values (10) | 9800 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 35128 | |
| Uppercase Letter | 5314 | 12.5% |
| Space Separator | 2002 | 4.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 5314 | |
| i | 4690 | |
| p | 4004 | |
| f | 4004 | |
| u | 3374 | |
| c | 2626 | |
| l | 2626 | |
| s | 2002 | 5.7% |
| r | 1372 | 3.9% |
| n | 1310 | 3.7% |
| Other values (5) | 3806 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2002 | |
| S | 2002 | |
| F | 686 | 12.9% |
| T | 624 | 11.7% |
Space Separator
| Value | Count | Frequency (%) |
| 2002 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 40442 | |
| Common | 2002 | 4.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 5314 | |
| i | 4690 | |
| p | 4004 | |
| f | 4004 | |
| u | 3374 | |
| c | 2626 | 6.5% |
| l | 2626 | 6.5% |
| O | 2002 | 5.0% |
| s | 2002 | 5.0% |
| S | 2002 | 5.0% |
| Other values (9) | 7798 |
Common
| Value | Count | Frequency (%) |
| 2002 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42444 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 5314 | |
| i | 4690 | |
| p | 4004 | |
| f | 4004 | |
| u | 3374 | 7.9% |
| c | 2626 | 6.2% |
| l | 2626 | 6.2% |
| O | 2002 | 4.7% |
| s | 2002 | 4.7% |
| S | 2002 | 4.7% |
| Other values (10) | 9800 |
| Distinct | 17 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Binders | |
|---|---|
| Paper | |
| Furnishings | |
| Phones | |
| Storage | |
| Other values (12) |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.188707729 |
| Min length | 3 |
Characters and Unicode
| Total characters | 23809 |
|---|---|
| Distinct characters | 28 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Paper |
|---|---|
| 2nd row | Chairs |
| 3rd row | Paper |
| 4th row | Phones |
| 5th row | Storage |
Common Values
| Value | Count | Frequency (%) |
| Binders | 500 | |
| Paper | 459 | |
| Furnishings | 316 | |
| Phones | 294 | |
| Storage | 288 | |
| Art | 282 | |
| Accessories | 275 | |
| Chairs | 190 | 5.7% |
| Appliances | 165 | 5.0% |
| Labels | 114 | 3.4% |
| Other values (7) | 429 |
Length
| Value | Count | Frequency (%) |
| binders | 500 | |
| paper | 459 | |
| furnishings | 316 | |
| phones | 294 | |
| storage | 288 | |
| art | 282 | |
| accessories | 275 | |
| chairs | 190 | 5.7% |
| appliances | 165 | 5.0% |
| labels | 114 | 3.4% |
| Other values (7) | 429 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 3289 | |
| e | 2934 | |
| r | 2396 | 10.1% |
| i | 1876 | 7.9% |
| n | 1759 | 7.4% |
| a | 1493 | 6.3% |
| o | 1102 | 4.6% |
| p | 1000 | 4.2% |
| h | 833 | 3.5% |
| c | 824 | 3.5% |
| Other values (18) | 6303 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20497 | |
| Uppercase Letter | 3312 | 13.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 3289 | |
| e | 2934 | |
| r | 2396 | |
| i | 1876 | |
| n | 1759 | |
| a | 1493 | |
| o | 1102 | 5.4% |
| p | 1000 | 4.9% |
| h | 833 | 4.1% |
| c | 824 | 4.0% |
| Other values (8) | 2991 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 753 | |
| A | 722 | |
| B | 576 | |
| F | 380 | |
| S | 347 | |
| C | 212 | 6.4% |
| L | 114 | 3.4% |
| T | 104 | 3.1% |
| E | 71 | 2.1% |
| M | 33 | 1.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23809 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 3289 | |
| e | 2934 | |
| r | 2396 | 10.1% |
| i | 1876 | 7.9% |
| n | 1759 | 7.4% |
| a | 1493 | 6.3% |
| o | 1102 | 4.6% |
| p | 1000 | 4.2% |
| h | 833 | 3.5% |
| c | 824 | 3.5% |
| Other values (18) | 6303 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23809 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 3289 | |
| e | 2934 | |
| r | 2396 | 10.1% |
| i | 1876 | 7.9% |
| n | 1759 | 7.4% |
| a | 1493 | 6.3% |
| o | 1102 | 4.6% |
| p | 1000 | 4.2% |
| h | 833 | 3.5% |
| c | 824 | 3.5% |
| Other values (18) | 6303 |
| Distinct | 1511 |
|---|---|
| Distinct (%) | 45.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| Easy-staple paper | 16 |
|---|---|
| Staples | 15 |
| Staples in misc. colors | 12 |
| Staple envelope | 11 |
| Storex Dura Pro Binders | 8 |
| Other values (1506) |
Length
| Max length | 127 |
|---|---|
| Median length | 36 |
| Mean length | 37.0513285 |
| Min length | 5 |
Characters and Unicode
| Total characters | 122714 |
|---|---|
| Distinct characters | 84 |
| Distinct categories | 12 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 3 ? |
Unique
| Unique | 554 ? |
|---|---|
| Unique (%) | 16.7% |
Sample
| 1st row | Xerox 1967 |
|---|---|
| 2nd row | Global Deluxe Stacking Chair, Gray |
| 3rd row | Easy-staple paper |
| 4th row | Panasonic Kx-TS550 |
| 5th row | Advantus 10-Drawer Portable Organizer, Chrome Metal Frame, Smoke Drawers |
Common Values
| Value | Count | Frequency (%) |
| Easy-staple paper | 16 | 0.5% |
| Staples | 15 | 0.5% |
| Staples in misc. colors | 12 | 0.4% |
| Staple envelope | 11 | 0.3% |
| Storex Dura Pro Binders | 8 | 0.2% |
| Global Wood Trimmed Manager's Task Chair, Khaki | 8 | 0.2% |
| Staple remover | 8 | 0.2% |
| Logitech Desktop MK120 Mouse and keyboard Combo | 7 | 0.2% |
| Adjustable Depth Letter/Legal Cart | 7 | 0.2% |
| Sterilite Officeware Hinged File Box | 7 | 0.2% |
| Other values (1501) | 3213 |
Length
| Value | Count | Frequency (%) |
| xerox | 292 | 1.6% |
| x | 220 | 1.2% |
| 202 | 1.1% | |
| with | 190 | 1.0% |
| for | 183 | 1.0% |
| binders | 178 | 1.0% |
| avery | 173 | 0.9% |
| chair | 151 | 0.8% |
| black | 147 | 0.8% |
| phone | 114 | 0.6% |
| Other values (2479) | 16721 |
Most occurring characters
| Value | Count | Frequency (%) |
| 15134 | 12.3% | |
| e | 11264 | 9.2% |
| r | 6984 | 5.7% |
| o | 6626 | 5.4% |
| a | 6392 | 5.2% |
| i | 6207 | 5.1% |
| l | 5439 | 4.4% |
| n | 5044 | 4.1% |
| s | 4940 | 4.0% |
| t | 4800 | 3.9% |
| Other values (74) | 49884 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 79386 | |
| Uppercase Letter | 18619 | 15.2% |
| Space Separator | 15275 | 12.4% |
| Decimal Number | 5948 | 4.8% |
| Other Punctuation | 2415 | 2.0% |
| Dash Punctuation | 985 | 0.8% |
| Final Punctuation | 24 | < 0.1% |
| Open Punctuation | 21 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Math Symbol | 9 | < 0.1% |
| Other values (2) | 11 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11264 | |
| r | 6984 | 8.8% |
| o | 6626 | 8.3% |
| a | 6392 | 8.1% |
| i | 6207 | 7.8% |
| l | 5439 | 6.9% |
| n | 5044 | 6.4% |
| s | 4940 | 6.2% |
| t | 4800 | 6.0% |
| c | 2950 | 3.7% |
| Other values (18) | 18740 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 2071 | 11.1% |
| C | 2026 | 10.9% |
| B | 1851 | 9.9% |
| P | 1616 | 8.7% |
| M | 1037 | 5.6% |
| D | 985 | 5.3% |
| A | 925 | 5.0% |
| T | 878 | 4.7% |
| F | 878 | 4.7% |
| L | 747 | 4.0% |
| Other values (16) | 5605 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1054 | |
| / | 557 | |
| " | 412 | 17.1% |
| . | 177 | 7.3% |
| & | 90 | 3.7% |
| ' | 76 | 3.1% |
| # | 28 | 1.2% |
| % | 12 | 0.5% |
| ! | 5 | 0.2% |
| ; | 2 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1216 | |
| 0 | 977 | |
| 2 | 769 | |
| 4 | 568 | |
| 3 | 513 | |
| 5 | 477 | 8.0% |
| 8 | 417 | 7.0% |
| 9 | 397 | 6.7% |
| 6 | 318 | 5.3% |
| 7 | 296 | 5.0% |
Space Separator
| Value | Count | Frequency (%) |
| 15134 | ||
| 141 | 0.9% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 985 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 24 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 9 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 8 |
Other Number
| Value | Count | Frequency (%) |
| ¾ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 98005 | |
| Common | 24709 | 20.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11264 | 11.5% |
| r | 6984 | 7.1% |
| o | 6626 | 6.8% |
| a | 6392 | 6.5% |
| i | 6207 | 6.3% |
| l | 5439 | 5.5% |
| n | 5044 | 5.1% |
| s | 4940 | 5.0% |
| t | 4800 | 4.9% |
| c | 2950 | 3.0% |
| Other values (44) | 37359 |
Common
| Value | Count | Frequency (%) |
| 15134 | ||
| 1 | 1216 | 4.9% |
| , | 1054 | 4.3% |
| - | 985 | 4.0% |
| 0 | 977 | 4.0% |
| 2 | 769 | 3.1% |
| 4 | 568 | 2.3% |
| / | 557 | 2.3% |
| 3 | 513 | 2.1% |
| 5 | 477 | 1.9% |
| Other values (20) | 2459 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 122531 | |
| None | 151 | 0.1% |
| Punctuation | 32 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 15134 | 12.4% | |
| e | 11264 | 9.2% |
| r | 6984 | 5.7% |
| o | 6626 | 5.4% |
| a | 6392 | 5.2% |
| i | 6207 | 5.1% |
| l | 5439 | 4.4% |
| n | 5044 | 4.1% |
| s | 4940 | 4.0% |
| t | 4800 | 3.9% |
| Other values (68) | 49701 |
None
| Value | Count | Frequency (%) |
| 141 | ||
| é | 6 | 4.0% |
| ¾ | 3 | 2.0% |
| à | 1 | 0.7% |
Punctuation
| Value | Count | Frequency (%) |
| ” | 24 | |
| “ | 8 | 25.0% |
| Distinct | 2623 |
|---|---|
| Distinct (%) | 79.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 221.3814176 |
| Minimum | 0.444 |
|---|---|
| Maximum | 13999.96 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 0.444 |
|---|---|
| 5-th percentile | 5.106 |
| Q1 | 17.018 |
| median | 53.81 |
| Q3 | 205.1057 |
| 95-th percentile | 907.643 |
| Maximum | 13999.96 |
| Range | 13999.516 |
| Interquartile range (IQR) | 188.0877 |
Descriptive statistics
| Standard deviation | 585.2575313 |
|---|---|
| Coefficient of variation (CV) | 2.643661503 |
| Kurtosis | 179.3055029 |
| Mean | 221.3814176 |
| Median Absolute Deviation (MAD) | 44.488 |
| Skewness | 10.55472573 |
| Sum | 733215.2552 |
| Variance | 342526.3779 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12.96 | 20 | 0.6% |
| 15.552 | 14 | 0.4% |
| 19.44 | 13 | 0.4% |
| 20.736 | 12 | 0.4% |
| 10.368 | 11 | 0.3% |
| 25.92 | 9 | 0.3% |
| 32.4 | 6 | 0.2% |
| 18.24 | 6 | 0.2% |
| 6.48 | 5 | 0.2% |
| 8.64 | 5 | 0.2% |
| Other values (2613) | 3211 |
| Value | Count | Frequency (%) |
| 0.444 | 1 | |
| 0.556 | 1 | |
| 0.99 | 1 | |
| 1.08 | 1 | |
| 1.188 | 1 | |
| 1.188 | 2 | |
| 1.248 | 2 | |
| 1.392 | 1 | |
| 1.408 | 1 | |
| 1.44 | 1 |
| Value | Count | Frequency (%) |
| 13999.96 | 1 | |
| 11199.968 | 1 | |
| 10499.97 | 1 | |
| 7999.98 | 1 | |
| 5443.96 | 1 | |
| 5199.96 | 1 | |
| 5083.96 | 1 | |
| 4799.984 | 1 | |
| 4663.736 | 1 | |
| 4416.174 | 1 |
Quantity
Real number (ℝ≥0)
| Distinct | 14 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.766908213 |
| Minimum | 1 |
|---|---|
| Maximum | 14 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 8 |
| Maximum | 14 |
| Range | 13 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.221776109 |
|---|---|
| Coefficient of variation (CV) | 0.5898142412 |
| Kurtosis | 1.793741495 |
| Mean | 3.766908213 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.223969076 |
| Sum | 12476 |
| Variance | 4.936289079 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 781 | |
| 3 | 759 | |
| 5 | 441 | |
| 4 | 398 | |
| 1 | 337 | |
| 7 | 190 | 5.7% |
| 6 | 173 | 5.2% |
| 8 | 99 | 3.0% |
| 9 | 80 | 2.4% |
| 10 | 18 | 0.5% |
| Other values (4) | 36 | 1.1% |
| Value | Count | Frequency (%) |
| 1 | 337 | |
| 2 | 781 | |
| 3 | 759 | |
| 4 | 398 | |
| 5 | 441 | |
| 6 | 173 | 5.2% |
| 7 | 190 | 5.7% |
| 8 | 99 | 3.0% |
| 9 | 80 | 2.4% |
| 10 | 18 | 0.5% |
| Value | Count | Frequency (%) |
| 14 | 8 | 0.2% |
| 13 | 8 | 0.2% |
| 12 | 7 | 0.2% |
| 11 | 13 | 0.4% |
| 10 | 18 | 0.5% |
| 9 | 80 | 2.4% |
| 8 | 99 | 3.0% |
| 7 | 190 | |
| 6 | 173 | 5.2% |
| 5 | 441 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1564673913 |
| Minimum | 0 |
|---|---|
| Maximum | 0.8 |
| Zeros | 1590 |
| Zeros (%) | 48.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0.2 |
| Q3 | 0.2 |
| 95-th percentile | 0.7 |
| Maximum | 0.8 |
| Range | 0.8 |
| Interquartile range (IQR) | 0.2 |
Descriptive statistics
| Standard deviation | 0.2074291213 |
|---|---|
| Coefficient of variation (CV) | 1.325701922 |
| Kurtosis | 2.461983121 |
| Mean | 0.1564673913 |
| Median Absolute Deviation (MAD) | 0.2 |
| Skewness | 1.699831702 |
| Sum | 518.22 |
| Variance | 0.04302684037 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 1590 | |
| 0.2 | 1223 | |
| 0.7 | 138 | 4.2% |
| 0.8 | 107 | 3.2% |
| 0.4 | 69 | 2.1% |
| 0.3 | 68 | 2.1% |
| 0.6 | 39 | 1.2% |
| 0.1 | 28 | 0.8% |
| 0.5 | 19 | 0.6% |
| 0.15 | 16 | 0.5% |
| Other values (2) | 15 | 0.5% |
| Value | Count | Frequency (%) |
| 0 | 1590 | |
| 0.1 | 28 | 0.8% |
| 0.15 | 16 | 0.5% |
| 0.2 | 1223 | |
| 0.3 | 68 | 2.1% |
| 0.32 | 11 | 0.3% |
| 0.4 | 69 | 2.1% |
| 0.45 | 4 | 0.1% |
| 0.5 | 19 | 0.6% |
| 0.6 | 39 | 1.2% |
| Value | Count | Frequency (%) |
| 0.8 | 107 | 3.2% |
| 0.7 | 138 | 4.2% |
| 0.6 | 39 | 1.2% |
| 0.5 | 19 | 0.6% |
| 0.45 | 4 | 0.1% |
| 0.4 | 69 | 2.1% |
| 0.32 | 11 | 0.3% |
| 0.3 | 68 | 2.1% |
| 0.2 | 1223 | |
| 0.15 | 16 | 0.5% |
| Distinct | 2913 |
|---|---|
| Distinct (%) | 88.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.21233986 |
| Minimum | -3839.9904 |
|---|---|
| Maximum | 6719.9808 |
| Zeros | 19 |
| Zeros (%) | 0.6% |
| Negative | 620 |
| Negative (%) | 18.7% |
| Memory size | 26.0 KiB |
Quantile statistics
| Minimum | -3839.9904 |
|---|---|
| 5-th percentile | -52.39104 |
| Q1 | 1.7632 |
| median | 8.2968 |
| Q3 | 28.315125 |
| 95-th percentile | 163.80095 |
| Maximum | 6719.9808 |
| Range | 10559.9712 |
| Interquartile range (IQR) | 26.551925 |
Descriptive statistics
| Standard deviation | 241.8643416 |
|---|---|
| Coefficient of variation (CV) | 8.5729983 |
| Kurtosis | 300.4536849 |
| Mean | 28.21233986 |
| Median Absolute Deviation (MAD) | 10.7688 |
| Skewness | 8.217176714 |
| Sum | 93439.2696 |
| Variance | 58498.35974 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 19 | 0.6% |
| 6.2208 | 16 | 0.5% |
| 9.3312 | 13 | 0.4% |
| 7.2576 | 12 | 0.4% |
| 5.4432 | 11 | 0.3% |
| 3.6288 | 9 | 0.3% |
| 12.4416 | 7 | 0.2% |
| 15.552 | 6 | 0.2% |
| 114.9385 | 4 | 0.1% |
| 9.072 | 4 | 0.1% |
| Other values (2903) | 3211 |
| Value | Count | Frequency (%) |
| -3839.9904 | 1 | |
| -3399.98 | 1 | |
| -2929.4845 | 1 | |
| -2287.782 | 1 | |
| -1306.5504 | 1 | |
| -1237.8462 | 1 | |
| -1143.891 | 1 | |
| -1141.47 | 1 | |
| -1049.3406 | 1 | |
| -1002.7836 | 1 |
| Value | Count | Frequency (%) |
| 6719.9808 | 1 | |
| 5039.9856 | 1 | |
| 3919.9888 | 1 | |
| 2504.2216 | 1 | |
| 1906.485 | 1 | |
| 1668.205 | 1 | |
| 1453.1238 | 1 | |
| 1439.976 | 1 | |
| 1379.977 | 1 | |
| 1351.9896 | 1 |
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 26.0 KiB |
| 2020-12 | |
|---|---|
| 2020-09 | |
| 2020-11 | |
| 2020-10 | |
| 2020-06 | |
| Other values (7) |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Characters and Unicode
| Total characters | 23184 |
|---|---|
| Distinct characters | 11 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2020-04 |
|---|---|
| 2nd row | 2020-07 |
| 3rd row | 2020-10 |
| 4th row | 2020-09 |
| 5th row | 2020-09 |
Common Values
| Value | Count | Frequency (%) |
| 2020-12 | 462 | |
| 2020-09 | 459 | |
| 2020-11 | 459 | |
| 2020-10 | 298 | |
| 2020-06 | 245 | |
| 2020-05 | 242 | |
| 2020-03 | 238 | |
| 2020-07 | 226 | |
| 2020-08 | 218 | |
| 2020-04 | 203 | |
| Other values (2) | 262 |
Length
| Value | Count | Frequency (%) |
| 2020-12 | 462 | |
| 2020-09 | 459 | |
| 2020-11 | 459 | |
| 2020-10 | 298 | |
| 2020-06 | 245 | |
| 2020-05 | 242 | |
| 2020-03 | 238 | |
| 2020-07 | 226 | |
| 2020-08 | 218 | |
| 2020-04 | 203 | |
| Other values (2) | 262 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 9015 | |
| 2 | 7193 | |
| - | 3312 | 14.3% |
| 1 | 1833 | 7.9% |
| 9 | 459 | 2.0% |
| 6 | 245 | 1.1% |
| 5 | 242 | 1.0% |
| 3 | 238 | 1.0% |
| 7 | 226 | 1.0% |
| 8 | 218 | 0.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19872 | |
| Dash Punctuation | 3312 | 14.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 9015 | |
| 2 | 7193 | |
| 1 | 1833 | 9.2% |
| 9 | 459 | 2.3% |
| 6 | 245 | 1.2% |
| 5 | 242 | 1.2% |
| 3 | 238 | 1.2% |
| 7 | 226 | 1.1% |
| 8 | 218 | 1.1% |
| 4 | 203 | 1.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3312 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 23184 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 9015 | |
| 2 | 7193 | |
| - | 3312 | 14.3% |
| 1 | 1833 | 7.9% |
| 9 | 459 | 2.0% |
| 6 | 245 | 1.1% |
| 5 | 242 | 1.0% |
| 3 | 238 | 1.0% |
| 7 | 226 | 1.0% |
| 8 | 218 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23184 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 9015 | |
| 2 | 7193 | |
| - | 3312 | 14.3% |
| 1 | 1833 | 7.9% |
| 9 | 459 | 2.0% |
| 6 | 245 | 1.1% |
| 5 | 242 | 1.0% |
| 3 | 238 | 1.0% |
| 7 | 226 | 1.0% |
| 8 | 218 | 0.9% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| Row ID | Order ID | Order Date | Ship Mode | Customer ID | Customer Name | Segment | Country | City | State | Postal Code | Region | Product ID | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | month_year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 13 | CA-2017-114412 | 2020-04-15 | Standard Class | AA-10480 | Andrew Allen | Consumer | United States | Concord | North Carolina | 28027 | South | OFF-PA-10002365 | Office Supplies | Paper | Xerox 1967 | 15.552 | 3 | 0.2 | 5.4432 | 2020-04 |
| 1 | 24 | US-2017-156909 | 2020-07-16 | Second Class | SF-20065 | Sandra Flanagan | Consumer | United States | Philadelphia | Pennsylvania | 19140 | East | FUR-CH-10002774 | Furniture | Chairs | Global Deluxe Stacking Chair, Gray | 71.372 | 2 | 0.3 | -1.0196 | 2020-07 |
| 2 | 35 | CA-2017-107727 | 2020-10-19 | Second Class | MA-17560 | Matt Abelman | Home Office | United States | Houston | Texas | 77095 | Central | OFF-PA-10000249 | Office Supplies | Paper | Easy-staple paper | 29.472 | 3 | 0.2 | 9.9468 | 2020-10 |
| 3 | 42 | CA-2017-120999 | 2020-09-10 | Standard Class | LC-16930 | Linda Cazamias | Corporate | United States | Naperville | Illinois | 60540 | Central | TEC-PH-10004093 | Technology | Phones | Panasonic Kx-TS550 | 147.168 | 4 | 0.2 | 16.5564 | 2020-09 |
| 4 | 44 | CA-2017-139619 | 2020-09-19 | Standard Class | ES-14080 | Erin Smith | Corporate | United States | Melbourne | Florida | 32935 | South | OFF-ST-10003282 | Office Supplies | Storage | Advantus 10-Drawer Portable Organizer, Chrome Metal Frame, Smoke Drawers | 95.616 | 2 | 0.2 | 9.5616 | 2020-09 |
| 5 | 72 | CA-2017-114440 | 2020-09-14 | Second Class | TB-21520 | Tracy Blumstein | Consumer | United States | Jackson | Michigan | 49201 | Central | OFF-PA-10004675 | Office Supplies | Paper | Telephone Message Books with Fax/Mobile Section, 5 1/2" x 3 3/16" | 19.050 | 3 | 0.0 | 8.7630 | 2020-09 |
| 6 | 76 | US-2017-118038 | 2020-12-09 | First Class | KB-16600 | Ken Brennan | Corporate | United States | Houston | Texas | 77041 | Central | OFF-BI-10004182 | Office Supplies | Binders | Economy Binders | 1.248 | 3 | 0.8 | -1.9344 | 2020-12 |
| 7 | 77 | US-2017-118038 | 2020-12-09 | First Class | KB-16600 | Ken Brennan | Corporate | United States | Houston | Texas | 77041 | Central | FUR-FU-10000260 | Furniture | Furnishings | 6" Cubicle Wall Clock, Black | 9.708 | 3 | 0.6 | -5.8248 | 2020-12 |
| 8 | 78 | US-2017-118038 | 2020-12-09 | First Class | KB-16600 | Ken Brennan | Corporate | United States | Houston | Texas | 77041 | Central | OFF-ST-10000615 | Office Supplies | Storage | SimpliFile Personal File, Black Granite, 15w x 6-15/16d x 11-1/4h | 27.240 | 3 | 0.2 | 2.7240 | 2020-12 |
| 9 | 85 | US-2017-119662 | 2020-11-13 | First Class | CS-12400 | Christopher Schild | Home Office | United States | Chicago | Illinois | 60623 | Central | OFF-ST-10003656 | Office Supplies | Storage | Safco Industrial Wire Shelving | 230.376 | 3 | 0.2 | -48.9549 | 2020-11 |
Last rows
| Row ID | Order ID | Order Date | Ship Mode | Customer ID | Customer Name | Segment | Country | City | State | Postal Code | Region | Product ID | Category | Sub-Category | Product Name | Sales | Quantity | Discount | Profit | month_year | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3302 | 9968 | CA-2017-153871 | 2020-12-11 | Standard Class | RB-19435 | Richard Bierner | Consumer | United States | Plainfield | New Jersey | 7060 | East | OFF-BI-10004209 | Office Supplies | Binders | Fellowes Twister Kit, Gray/Clear, 3/pkg | 40.200 | 5 | 0.0 | 18.0900 | 2020-12 |
| 3303 | 9969 | CA-2017-153871 | 2020-12-11 | Standard Class | RB-19435 | Richard Bierner | Consumer | United States | Plainfield | New Jersey | 7060 | East | OFF-BI-10004600 | Office Supplies | Binders | Ibico Ibimaster 300 Manual Binding System | 735.980 | 2 | 0.0 | 331.1910 | 2020-12 |
| 3304 | 9970 | CA-2017-153871 | 2020-12-11 | Standard Class | RB-19435 | Richard Bierner | Consumer | United States | Plainfield | New Jersey | 7060 | East | OFF-AP-10003622 | Office Supplies | Appliances | Bravo II Megaboss 12-Amp Hard Body Upright, Replacement Belts, 2 Belts per Pack | 22.750 | 7 | 0.0 | 6.5975 | 2020-12 |
| 3305 | 9982 | CA-2017-163566 | 2020-08-03 | First Class | TB-21055 | Ted Butterfield | Consumer | United States | Fairfield | Ohio | 45014 | East | OFF-LA-10004484 | Office Supplies | Labels | Avery 476 | 16.520 | 5 | 0.2 | 5.3690 | 2020-08 |
| 3306 | 9988 | CA-2017-163629 | 2020-11-17 | Standard Class | RA-19885 | Ruben Ausman | Corporate | United States | Athens | Georgia | 30605 | South | TEC-AC-10001539 | Technology | Accessories | Logitech G430 Surround Sound Gaming Headset with Dolby 7.1 Technology | 79.990 | 1 | 0.0 | 28.7964 | 2020-11 |
| 3307 | 9989 | CA-2017-163629 | 2020-11-17 | Standard Class | RA-19885 | Ruben Ausman | Corporate | United States | Athens | Georgia | 30605 | South | TEC-PH-10004006 | Technology | Phones | Panasonic KX - TS880B Telephone | 206.100 | 5 | 0.0 | 55.6470 | 2020-11 |
| 3308 | 9991 | CA-2017-121258 | 2020-02-26 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | FUR-FU-10000747 | Furniture | Furnishings | Tenex B1-RE Series Chair Mats for Low Pile Carpets | 91.960 | 2 | 0.0 | 15.6332 | 2020-02 |
| 3309 | 9992 | CA-2017-121258 | 2020-02-26 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | TEC-PH-10003645 | Technology | Phones | Aastra 57i VoIP phone | 258.576 | 2 | 0.2 | 19.3932 | 2020-02 |
| 3310 | 9993 | CA-2017-121258 | 2020-02-26 | Standard Class | DB-13060 | Dave Brooks | Consumer | United States | Costa Mesa | California | 92627 | West | OFF-PA-10004041 | Office Supplies | Paper | It's Hot Message Books with Stickers, 2 3/4" x 5" | 29.600 | 4 | 0.0 | 13.3200 | 2020-02 |
| 3311 | 9994 | CA-2017-119914 | 2020-05-04 | Second Class | CC-12220 | Chris Cortes | Consumer | United States | Westminster | California | 92683 | West | OFF-AP-10002684 | Office Supplies | Appliances | Acco 7-Outlet Masterpiece Power Center, Wihtout Fax/Phone Line Protection | 243.160 | 2 | 0.0 | 72.9480 | 2020-05 |